Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization
نویسنده
چکیده
This paper presents a novel framework for unsupervised compensation of intra-session intra-speaker variability in the context of speaker diarization. Audio files are parameterized by sequences of GMM-supervectors representing overlapping short segments of speech. Session-dependent intra-session intra-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisance attribute projection (NAP) method. The proposed compensation method is evaluated in the context of speaker diarization in two-speaker conversations. A simple and effective twospeaker diarization algorithm is introduced in which speaker diarization is performed in the compensated supervectorspace. The proposed diarization algorithm was evaluated on summed telephone conversations and achieved a speaker error rate of 2.8% which is a 54% relative error reduction compared to a baseline BIC-based system. Finally, we evaluate the proposed system on a speaker recognition task in the summedspeech condition where improvement in speaker recognition accuracy is observed using the proposed diarization system.
منابع مشابه
Speaker Diarization Based on Gmm Supervectors and Unsupervised Intra-speaker Variability Modeling
This paper presents a novel framework for speaker diarization. Audio is parameterized by a sequence of GMM-supervectors representing overlapping short segments of speech. Session dependent intra-session intra-speaker variability is estimated online in an unsupervised manner, and is removed from the supervectors using Nuisance Attribute Projection (NAP) The supervectors are then projected using ...
متن کاملSpeaker Diarization using Unsupervised Compensation of Within-Speaker Variability
This paper presents a novel framework for unsupervised compensation of within-speaker variability in the context of speaker diarization. Audio session is divided into overlapping short segments, each one parameterized by a GMM-supervector. For each session independently within-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisance attribute projection ...
متن کاملIntra-session Variability Compensation for Speaker Segmentation
This paper addresses the problem of speaker segmentation in two speaker telephone conversations, proposing a segmentation approach based on factor analysis and a novel method for intra-session variability compensation to improve segmentation performance. The segmentation system is evaluated on the NIST Speaker Recognition Evaluation 2008 summed channel test condition, showing that intra-session...
متن کاملSpeaker Diarization in Personal Video Recordings Based on LDA and User Feedback
In this paper, we present the speaker diarization system which is used in personal video recordings. Speaker diarization begins by the extraction of relevant features from the input signal. Features are measurable characteristics which are important to the distinction between different classes. They should have low inter-class similarity and also low intra-class variability. So, LDA is used to ...
متن کاملExploiting Intra-Conversation Variability for Speaker Diarization
In this paper, we propose a new approach to speaker diarization based on the Total Variability approach to speaker verification. Drawing on previous work done in applying factor analysis priors to the diarization problem, we arrive at a simplified approach that exploits intra-conversation variability in the Total Variability space through the use of Principal Component Analysis (PCA). Using our...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010